Accelerating Electron Tomography Reconstruction Algorithm ICON Using the Intel Xeon Phi Coprocessor on Tianhe-2 Supercomputer
نویسندگان
چکیده
Electron tomography (ET) is an important method for studying three-dimensional cell ultrastructure. Combining with a subvolume averaging approach, ET provides new possibilities for investigating in situ macromolecular complexes in sub-nanometer resolution. Because of the limited sampling angles, ET reconstruction usually suffers from the ‘missing wedge’ problem. With a validation procedure, Iterative Compressed-sensing Optimized NUFFT reconstruction (ICON) demonstrates its power in the restoration of validated missing information for low SNR biological ET dataset. However, the huge computational demand has become a bottleneck for the application of ICON. In this work, we developed the strategies of parallelization for NUFFT and ICON, and then implemented them on a Xeon Phi 31SP coprocessor to generate the parallel program ICON-MIC. We also proposed a hybrid task allocation strategy and extended ICON-MIC on multiple Xeon Phi cards on Tianhe-2 supercomputer to generate program ICON-MULT-MIC. With high accuracy, ICON-MIC has a significant acceleration compared to the CPU version, up to 13.3x, and ICON-MULT-MIC has good weak and strong scalability efficiency on Tianhe-2 supercomputer.
منابع مشابه
Optimization of Binomial Option Pricing on Intel MIC Heterogeneous System
In these years, computerization has been more and more important in the financial area. The computational intensity and realtime constraints of those financial models require high-throughput parallel architectures. In this paper, optimization of widely-used binomial option pricing model has been implemented on the worlds largest supercomputer, Tianhe-2. In our work, we employ several optimizing...
متن کاملFirst experiences with the Intel MIC architecture at LRZ
With the rapidly growing demand for computing power new accelerator based architectures have entered the world of high performance computing since around 5 years. In particular GPGPUs have recently become very popular, however programming GPGPUs using programming languages like CUDA or OpenCL is cumbersome and errorprone. Trying to overcome these difficulties, Intel developed their own Many Int...
متن کاملAccelerating DNA Sequence Analysis using Intel Xeon Phi
Genetic information is increasing exponentially, doubling every 18 months. Analyzing this information within a reasonable amount of time requires parallel computing resources. While considerable research has addressed DNA analysis using GPUs, so far not much attention has been paid to the Intel Xeon Phi coprocessor. In this paper we present an algorithm for large-scale DNA analysis that exploit...
متن کاملScaling up Hartree-Fock calculations on Tianhe-2
This paper presents a new optimized and scalable code for Hartree–Fock self-consistent field iterations. Goals of the code design include scalability to large numbers of nodes, and the capability to simultaneously use CPUs and Intel Xeon Phi coprocessors. Issues we encountered as we optimized and scaled up the code on Tianhe-2 are described and addressed. A major issue is load balance, which is...
متن کاملTowards simulation of subcellular calcium dynamics at nanometre resolution
Numerical simulation of subcellular Ca2þ dynamics with a resolution down to one nanometre can be an important tool for discovering the physiological cause of many heart diseases. The requirement of enormous computational power, however, has made such simulations prohibitive so far. By using up to 12,288 Intel Xeon Phi 31S1P coprocessors on the new hybrid cluster Tianhe-2, which is the new numbe...
متن کامل